T. molitor winter larvae 



Native " Tm 12.86 " 
Purification of 
Type III AFP 



mRNA Isolation 
encoding Tm 12.86 



Generation of 
Antiserum 

against 
Tm 12.86 



Activator 
Partial 
Isolation 



Generation of cDNA libraries 







Screening cDNA libraries 







Planned 
Further 
Isolation 



Isolating and characterizing CLONES 



Tm 13.17 



2-2 2-3 3-4 



3-9 



7-5 



Used for RI ASSAY 
Development 

Detailing 

* Assay specificity 

* Assay sensitivity 
*Numerical quantitation 

-ice grain size 
-dilution profiles 
-regression analysis 
-RI Factor estimates 
-Slope comparisons 

*Reference comparisons 
to Tm 12.86 profiles 

^Mathematical modeling 

*Automation 
-multiple samples 
-image analyses 

*Light Scattering method 
for assessing RI 

*Applications 



► C Recombinant 

Proteins 
„ I Inactive 



Generating His-Tagged 
Signal Plus and 
Signal Minus Clones 



{ Recombinant 
A Proteins 
I Inactive 



Redirection towards 
Recombinant Proteins 
Isolated from Bacterial 
Inclusion Bodies 



Activity 



Quantified 



ANTIFREEZE ACTIVITY 
of Recombinant Peptides 



Capable of Further 



~E5E ancement 



a. 1.0 




d^. 4.3 



<Mp. 4.4 



(rain.) 
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ill 


















-4- — 


ijilvV 









d^j?. 1.5 



36.0 
35.8 
35.6 
35.4' 




3000 4000 6000 8000 10000 14000 18000 



Mass (m/z) 



d^. 4.6 




d^. i.7 



NH 2 -L-T-D-E-Q-I-Q-K-R.N-K-I-S-K- E-?-Q-Q-V 



44.0 
32.3 

17.4 
7.5 

1 2.5 5 10 15 20 H 




ura osz a o 



3.0 




[AFP] (mg/ml) 



1 2 3 4 5 6 




123456789 10 11 kD 



94 
67 

43 



30 

21 
14 




d^. 2.3 



t* £c5&l8l5So&i& T7 

I o mw[omLiiixx2<wy 1 ' 

lacP ^^^mcs^-^ 





A J 


bio 


IF 


alacZ 
CMV 
ColE1 
Neo' 



















1 . Construct DNA Library 

2. Isolate positive clone 



lacP -** 





A J 


bio 


IF 


CMV 
ColE1 
Neo' 


fl 

















3. Excise pBK-CMV phagemid 
containing cloned DNA insert 
by co-infection with helper phage 




DNA sequence of Tm 13.17 cDNA clone 



B E 
a c 
m o 
H R 
I I 

MKLLCCL1S 

61 CCCTCATTCTGTTGGTCACAGTTCAGGCCCT^^ 

frlLLVTVOA^L TEAQIEKLSK 

121 AGATCAGCAAAAAATGTCAAAATGAAAGTGGAGTGTC^ 

181 GCiyU3GGimCTG<mGm<^TCC^^ 

KGDWBDDPKLX RQVFCVARN 

241 ACGX2CGGTCTGGCCACG<3AA¥CGG 

& G L & V E ' S GEVVVDVI»REKVR 

301 GGAAGGTCACTGACAACGACGAAGAAACTGAGAAAATC^^ 

KVTDNDEETEKI INKCAVKR 

361 GAGATACTGTTGAAGAGACGGTGTTCAATACTTTCAAATC 

DTVEETVFNTFKCVMKHKPK 

421 AGTTCTCACCAGTTGATTGJiACCACCACGACTAGTAGATGGTTCAAATGGTGTGCTTTAC 
F S 3? V D * 

h 
o 
I 

481 ATATAAAAAlFAAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAAerCG 

polyadenylation signal poly <A) tail (26) 
537 AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTTCCACCC 



< 



o o 
'Eh ^ 

a e 



CO 



O 
O 



o 

.2 2 

1 



1^ 

V ^ 
a w 



A'. Mature Tm 13.1? amine acid residure 

1 LTEAQIEKLN KISKRCQNES GVSQEIITKA RNGDWEDD PK LKRQVFCVAR 
52 NAGIATESGE WVDVLREKV RKVTDNDEET EKIINKCAVK RDTVEETVFN 
10'. TFKCVMKNKP KFSPVD 

8. Summary of the composition analysts for the mature Tm 13.17 
sequence: 



Residue 


Number 


Mole Percent 


A = Ala 


6 


5.172 


3 - Asx 


0 


o.ooo 


C - Cys 


4 


3.448 


D = Asr> 


8 


6.897 


h = Glu 


13 


11.207 


F = Phe 


4 


3 .448 


G ~ Gly 


4 


3.443 


H = His 


0 


0.000 


T - lie 


6 


5.172 


K =Lya 


16 


13.793 


r. ~ Leu 


5 


4.310 


r = Met 


1 


0.862 


K = Asn 


8 


6.897 


p _r Pro 


3 


2.586 


Q = Gin 


4 


3 .448 


K ~ Arg 


6 


5.172 


S ~ Ser 


5 


4.310 


T - Thr 


8 


6.897 


V =Val 


14 


12.069 


W - Trp 


1 


0.862 


Y ~ ' P y x: 


0 


0.000 


Z - Glx 


0 


0.000 



Molecular weight = 13171 .96; Residues =116; Average Residue Weight ~ 113.551 
Charge = 1 ; Isoelectric point = 7.74. 



ITTCGGCBCGRC 



28 
121 



441 

388 



481 

428 



TTGCTCTOTTC tJFII flffij T C |t cjclc TCRTTCTGTTGOT clfl fij 
^ l c Tj t{b|c T |T C| t )c T C B T T C TO T T 0 0 T C I C [C fll 

t t c a o o c clcfflojfl clc o la o ojc pjcffllR a It t o a o h n a c t oIb o fcj n 
,-TTCftGse c| b |T|ft |a c[ t c |r o o] r |r|o|b|c c |t T 0 B C C T 3 c t o{ c c[cjc 

Fh] e fiSl t |c « o c| h n P1 fi Fl a | T °| T cFIbI n t T ofiflfi6TooROToTc|o clal 
|fl|fi|nje jc a a cj o c| a jo |b| o |t o] c b| h|o|b|c |t o nunc t g o a c i c i c| c o|a{ 

1 'o flh C ft T R fljC Cjfljfl If) C C T C 0 C " ' * 

c o |t c ft t a flj fl ojalo |fl 0 C T C 0 c 



161 Ift o fl 
118 ft o h 



OOTGflCTjcfl .... 

B 0 T 0 B C T | T |G G fl 0 0 fl C j 



201 
148 



241 
188 



281 
228 



SHc c| T jft A ft C T 0 fl fl file OCCflflOTTT T T T 0 cjo [7|o Q clc R J 0 fo| H 
c |e cj c |fl R fl e t o s a aJ b t q c a b c t t c t t t o c| r[t{t t t |c b [ « |o | g 

r c o clcjc o t c[7|e[c|c[c|fl fc obrrtcogororIg o[7|o[o|t|o g It fc 0 flj 

C fi C t|c|o a fl B | T I B I C| TjC|C [COOflftTCOOORO B| B r|t|t|o|b [o G j C jC 0 flj 

Jclo tie t t |q [bI o jc d fi o R r oIoJt c fllojo nlfllo o i cJr cJt o|r c|bFr|c|6 fit 

|c[fl c [0 T t| c|r|B |0 GflOflflO| T [TC flj C jO fi| |0 G T| o [fl C| R B jfl c[ o}r|t |b fl- 



321 

"268 



361 
308 



401 

348 



i fl fl e 8 r fl 
j ft a o a r n c c 



c bJt c|fl|fljT|n fl o t o clolclcjo tJcEIr 0 I 
t o |t c| o]fl|o {fl ROTO c[ r|c|c |o t| q |b |c T ! 

s|fl1erftj tJa c i 6T TORHCfl o|r c g|g t bJtTJc r{r]t r c|7|t t|c fl B fl T 
t|fl(c[fl{c |fl c 1 ceo o fl fl o fl t [h c s| c e r |t t j t o[b|b o t[tJa c |c fl B ft T 

. , . . , . . , , , * 



It o fl r 



fl C fl fl 0 C C fljfl ftlolt T c t|c ft C C fl g 



t 16 

no 



Sc c b c c fl c|o fl c t R g It r| g a It gJg ^ [Tic Ib a bJt o o t g t 
o fl c| (t r| t t|t._oJ t c}t}g |r r b| 

fl T |li3 T IB fij RfiflTRRRGTGITTCTGfl 
G C j ft) C |B fl| 



RRRBRflRRRBflfift 



d^. 2.7 



Tit. J 3, 17 


3 


EAQI EKLNK I SKKCQNESGVSQEI ITKARNGDWEDDPKLKRQVFCVARNA 


52 


A1'P~3 


1 


H- Hi-- i- I--HI1M :.|-|| : MUM :-|M :.| 
ETPREKLKQHSDACKAESGVSEESLNKVRNREEVDDPKLKEHAFCILKRA 


50 


T-i 13,17 


53 


GIATESGEVVTOVLP£KVKKOTDNDEETEKIINKCAVTCRDT^/EEWFKTF 


102 


AFP- 3 


51 


H «MM. : 1 ::■!•:• 1 i! M M \ \ • : ) 
GFIDASGEFQLDHIKTKFKENSEHPEKVDDLVAKCAVKKDTPQHSSADFF 


100 


'Ix -3,1? 


103 


KCVMKNKP 110 




AFP- 3 


101 


Hi -h- 

KCVHDKRS 108 





Percent identity: 39.8 (identical amino acids; Percent similarity: 58.3 (identical amino 
acids plus conservative amino acids). 
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35 g I[Q[K|R|N K I S K 
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Trol3.17 


31 


R N 


G 


D 


W 
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D 


D 
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K 


I» 


K 


R 


Q 


Bl 
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G 


D 


L 


1 
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P 
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Q 


&FP-3 


23 
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R 
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75 I E A 
59 F Q I» 



V L R 

t r k 

H I K 




K C A V K| 
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TTO13.17 


91 


R 


D 


T 


v: 


M 


1 


T 


V 


P 


N T 


IL 


K 


C 


¥ 


M 


X 


N 


K P 


K 


Bl . 


106 


TS, 


D 


T 


p 


£ 


D 


T 


A 


F 


£ V 


T 


X 


c 


V 


L 


X 


D 


K P 


S 




89 


K 


D 


T 


p 


Q 


8 


S 


S 


A 


D F 


F 


K 


c 


V 


H 


D 


H 


R S 



S P V D 

F G D L F V 



2-2 



GGCACGAGCAA A A1AT G1 AAACTCCTCTTGT GCTTTGCGTTCGCCGCC 
TTK LLLCFAFAA 

ATCGTCATCGGAGCTC AGGCT CTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 



AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RNK I SKECQQVS GVS 

CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGATGAT 
QETI DKVRTGVLVDD 

CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCF SKKTG 

GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

GACACCTTCAAGTGTATTTACGACAGCAAACCTGATTTCTCTCCT 
DTFKCl YDSKPDFSP 

ATTGATTAATTGTTTTGTATTTGACTGAATTTT GAC AAT AAA GGT 
I D 

polyadenylation signal 

ACTATCGTTATGTAAAAAAAAAAAAAAAAA 
poly (A) tail 



d^. 3.0 



2-3 



ggcacgagcaaaaET^aaactcctcttqtgctttgctttcgccgcc 
mklllcfafaa 



AGGAAGAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
R N K I SKECQQVSGVS 

CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKCt YDSKPDFSP 



A T T G A T T A A T T GT T T T GT A T T T G A C T G A A T T T T G A C A_AT\AA_A GGT 
I D 

polyadenylation signal 

ACTATCGTTATGAAAAAAAAAAAAAAAAAA 
poly (A) tail 



Start 

2-2 GGCA CGA GC A AAAATGAAACTCCTCTTGTGCTTTG C[g| 

2-3 GGCA CGA GC A AAAATGAAACTCCTCTTGTGCTTTG C(t] 

2-2 T T CGC C G C C A TCGTCATCGGAGCTCAGGCTCTCACCG 

2-3 TTCGCCGCCATCGTCATCGGAGCTCAGGCTCTCACCG 

2-2 A C GA A C A GA T ACAGAAAAGGAACAAGATCAGCAAAGA 

2-3 ACGAACA GA T ACAGAAAAGGAACAAGATCAGCAAAGA 

2-2 AT GC C A G C A G GTGTCCGGAGTGTCCCAAGAGACGATC 

2-3 AT GC C A G C A G GTGTCCGGAGTGTCCCAAGAGACGATC 

2-2 GA CA A A GT C C GCACAGGTGTCTTGGTCG A|t|G A T C C C A 

2-3 gacaaagtccgcacaggtgtcttggtcg aIcJg A T C C C A 

2-2 AAAT GAAGAAGCACGTCCTCTGCTTCTCGAAGAAAAC 

2-3 AAAT GA A GA A GCACGTCCTCTGCTTCTCGAAGAAAAC 

2-2 T GGA GT GGCA ACCGAAGCCGGAGACACCAATGTGGAG 

2-3 T GGAGT GGCA ACCGAAGCCGGAGACACCAATGTGGAG 

2-2 GT A C T C A A A G CCAAGCTGAAGCATGTGGCCAGCGACG 

2-3 GT A CT C A A A GCCAAGCTGAAGCATGTGGCCAGCGACG 

2-2 aaga[g]gt GGACAAGATCGTGCAGAAGTGCGTGGTCAA 

2-3 AA G A [AjG T GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 

2-2 GA A GGC C A C A CCAGAGGAAACGGCTTATGACACCTTC 

2-3 GAAGGCCACA CCAGAGGAAACGGCTTATGACACCTTC 

2-2 AAGTGTATTTACGACA G[c|A AACCTGATTTCTCTCCTA 

2-3 AAGTGTATTTACGACA G[jJa AACCTGATTTCTCTCCTA 

2-2 T T GA T T A A T T GTTTTGTATTTGACTGAATTTTGACAA 

2-3 T T GA T T A A T T GTTTTGTATTTGACTGAATTTTGACAA 

2-2 T A A A G G T A lA]T A T C GT T A T GPrlA A A A A 

2-3 T A A A G G T A IcjT A T C GT T A T g|a|a A A A A 



Predicted Amino Acid 



Composition of 2-2 and 2-3 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040*5% 


1 A{280) = 


4.22 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 





Number 


%oy 


%by 


Amino AckJ(s) 


count 


wetqht 


frequency 


Charged (RKHYCOE) 


48 


47,19 


41.74 


Acidic (DE) 


20 






Basic (KR) 


20 


2&40 


1739 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILFWV) 


34 


27.26 


29.57 


A Ala 


6 


3.32 


5.22 


CCys 


4 


3.21 


3.46 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


Hie 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


L Leu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.98 


5.22 


R Arg 


2 


2.43 


1.74 


S Ser 


7 


4.75 


6.09 


TThr 


9 


7.08 


7.83 


Wat 


14 


10.80 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 



3.3 



A B C D E F MW 

41.8 
30.6 



I 



^^^^ 17.8 



6.9 



Lane 



1 2 3 4 5 




4.0 



Lane 1 2 3 4 5 6 

4000 bp— 



• ••if 



500 bp — 



d^. 4.2 



Lane 1 2 3 4 5 6 



4000 bp— 



1 



Lane 1 2 3 4 5 




d^*. 4.5. 



Tm 13.17 cDNA 



1 &€&K3G&TCXZMbMCs&hTT 



61 CCCTCATTCTGTTGGTCACAGTTCAGGCCCTGACCGAGGCACAAATTGAGAAACTGAACA 
LIItLVTVQA Ah TEAQXKKLMK 



121 A^TCAGCAAAAAATGTCAAAATG^ 

ISKXCQHESGVSQEIITKAR 

181 GCAACGGTGACTGGGAGGACGATCCTA^ 

HGDWEDDPKLKRQVFCVARN 

241 &QG£%JGGf^ 

A G L A T E SGEVVVDVLREKVR 

301 G<miy3QimCTGACAACGAC<^<mAA 

KVTDNDEETEKI INKCAVKR 
Reverse Primer 

361 GTCATACTGTTGAAGAG^GGTGTTCAATACTTTCAAAT^ 

DTVEE^VFtiTFKCVMKHKFK 

421 AGf^CTG&CCAGTTGAT^GAACGACCAGGACT^ 
F S P V D * 



481 ATATAAAAA2 , AAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAAC , rCG 

polyadenylation signal poly (A) tail (26) 
53? AGAGm!TTCTAGAGCGGCCGCGGGCCCATCGTT!PTCCACCC 



MKELCCliSS 



i Forward Primer 




Forward Primer 



LTDEQ1 QKRNKI SKECQQVSGVSGE 
LTEAQI EKLNKI SKKCQNESGVSQE 
LT E EDL QL L ROT SAECKT ESGASEA 
ETPBEKLKQHSDACKAE S|G V S E E 



Tl DKVRTGVLV 
TKARNGDWE 
KKARKGDLE 
SLNKVRNREEV 



2-2 

Tm 13.17 



2-2 

Tm 13.17 



DDPKMKKHVLCFSKKTGVATEAGDTNVEVLKAKLKH 
DDPKLKRQVFCVARNAGLATESGEVVVDVLREKVRK 
DDPKLKMQLLCl FKALEI VAESGEI EADTFKEKLTR 
DDPKL KEHAFCI LKRAGFI DASGEFQLDHI KTKFKE 



Reverse Primer 



VAS DEEVDKl VQKCVVKK 
VTDNDEETEKI I NKCAVKR 
VTNDDEESEKI VEKCTVTE 
NSEHPEKVDDLVAKCAVKK 



A T P E E T 

D T V E E T 

DT P E D T 

DT P Q H S 



AYDTFKCI YDS 
VF NT FKCVMKN 
AFEVTKCVLKD 
SADFFKCVHDN 



2-2 

Tm 13.17 



K P D F S P I D 
KPKFSPVD 
KPNFFGDL FV 
R S 



percent % composition 
Primer A C g T MeltingTemperature(°C) 



Forward 23.6 14.3 42.9 14.3 44.0 
Reverse 25.0 31.3 6.3 37.5 44.0 




d^. 4.7 




d^. 4.8 



Lane 1 2 3 4 5 




8000bp 
4000 bp 



3-4 



QGCACGAeCAA A AgTgAAACT CCTCTT GTGCTTTGCTTTCGCCGCC 



ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
i VI G A Q A j|,L T D E Q I Q K 



AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 



CAAGAGACGATCGACAAAGTCCGCACAGQTGTCTTGGTCGACGAT 



CCCAAAATGAAGAAGCACGTCCTCT6CTTCTCGAAGAAAACTGGA 



GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 



AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 



CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 



GACACCTTCAAGGTTATTTACGACAGTAAACCTGATTTCTCTCCT 



ATTGATTAATTGTTTTGTATTTGACTGAATTTTGAC AATAAA 6GT 
I D 

polyadenylation signal 

ACT ATCGTT AT GT AA AAA AAAAAAAAAAAA 



poly (A) tail 



Molecular Weight 


12839.70 m.w. 


Length 


115 


1 microgram = 


77.883 pMoles 


Molar Extinction coefficient 


2920±5% 


1 A(280) = 


4.40 mg/ml 


Isoelectric Point 


7.14 


Charge at pH 7 


0.16 





Number 


% by 


% by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


47 


46.41 


40.87 


Acidic (DE) 


20 


18.91 


17.39 


Basic (KR) 


20 


20.41 


17.39 


Polar (NCQSTY) 


29 


24.55 


25.22 


Hydrophobic (AILFWV) 


35 


28.04 


30.43 


A Ala 


6 


3.32 


5.22 


CCys 


3 


2.41 


2.61 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


1 lie 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.99 


5.22 


R Arg 


2 


2.43 


1.74 


SSer 


7 


4.75 


6.09 


TThr 


9 


7.09 


7.83 


VVal 


15 


11.58 


13.04 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


. Ter 


0 


0.00 


0.00 



Predicted Amino Acid 
Composition of 3-4 



3-9 



ggcacgagcaaaa IaTgI aaactcctcttgtgcttt gctttcgccgcc 



ATCGTCATCGGAGCTCAGGCTCTCACCGATGAACAGATACAGAAA 
I V i G A Q A T D E Q I Q K 

AGGAACAAGATCAGCAAAGAATGCCAGCAGGAGTCCGGAGTGTCC 



CAAGAGACGAT CGACAAAGT CCGCACAGGT GT CTT GGT CGACGAT 
QETI DKVRTGVLVDD 

CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAGAACTGGA 
PKMKKHVLCFSKRTG 

GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 



AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 



CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPE ETAY 

GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 



ATTGATTAATTGTTTTGTATTTGACTGAATTT T G A C A A T A A A G G T 
I D 

polyadenylation signal 

ACTATCGTTATGAAAAAAAAAAAAAAAAAA 
poly (A) tail 



Predicted Amino Acid 
Composition of 3-9 



Analysis 


Whole Protein 


Molecular Weight 


12871 .80 m.w. 


Length 


115 


1 microgram = 


77.689 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.23 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.31 


41.74 


Acidic (DE) 


20 


18.86 


17.39 


Basic (KR) 


20 


20.57 


17.39 


Polar (NCQSTY) 


30 


25.29 


26.09 


Hydrophobic (AILFWV) 


34 


27.20 


29.57 


A Ala 


6 


3.31 


5.22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.84 


9.57 


EGlu 


9 


9.03 


7.83 


F Phe 


3 


3.43 


2.61 


G Gly 




1.77 


3.48 


H His 


2 


2.13 


1.74 


I He 


6 


5.28 


5.22 


K Lys 


17 


16.93 


14.78 


LLeu 


5 


4.40 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.77 


1.74 


P Pro 


4 


3.02 


3.48 


Q Gin 


6 


5.97 


5.22 


R Arg 


3 


3.64 


2.61 


SSer 


7 


4.74 


6.09 


TThr 


9 


7.07 


7.83 


VVal 


14 


10.78 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


X Xxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 



7-5 



QQC ACQAQCA A A A^XHa A ACT CCT CTT QT QCTTT GCGTT CQCCGCC 



ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 



AGGAACAAGATCAGCAAAGAGTGCCAGCAGGTGTCCGGAGTGTCC 
RNKI SKECQQESGVS 

CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 



CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 



GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 



AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 



CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 



GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 



ATTGATTAATTGTTTTGTATTTGGCTGAATTTTGAC A A T A A A G G T 

I D * 

polyadenylation signal 



ACTATCGTTATGTAAAAAAAAAAAAAAAAA 
poly (A) tail 



Predicted Amino Acid 
Composition of 7-5 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.22 mg/ml 


isoelectric Point 


7.11 


Charge at pH 7 


0.13 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.19 


41.74 


Acidic (DE) 


20 


18.90 


17.39 


Basic (KR) 


20 


20.40 


17.39 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILFWV) 


34 


27.26 


29.57 


A Ala 


6 


3.32 


5.22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


1 lie 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.98 


5.22 


R Arg 


2 


2.43 


1.74 


SSer 


7 


4.75 


6.09 


TThr 


9 


7.08 


7.83 


VVal 


14 


10.80 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 



^ m s: * 

>>>>> 

jjjjj >>>>> 

>>>» >>>>> 

OCJOtDO OOQOO 

CCCCCCOCfC OOOOO 

>>>>> >>>>> 

MMMMM 

00000 *:x*zx*: 

aaaaa 

hhhhh »»> 

ujiuuiuiui tutiituiuuj 

OOOOO lUlUtUUIUJ 

co co co coco a a aa a 

»>» 

OOOOO tOCOCOCOCO 

cocococoto <<<<< 

>>>@> >>>>> 

ooooo inn 

OOOOO * * * * M 

OOOOO _l_l_i-J_J 

aimujujiu bij^M^ii 

M M M id M <<<<< 

CO CO CO CO CO MMMM V 



MMMMM >>>>> 

Z2zzz uiujujujuj 

ccococQca: >>>>> 

MMMMM ZZZZZ * * » * * 

ooooo i-Hhi-h ooaaa 
ooooo — 

OOOOO OOOOO 0.0.0-0.0. 

luujujmiu <<<<< co co co co co 

QQQQQ UIUJUJUJUJ lfcU.tl.ll.U- 

HHHI-h hhht-h QQOaO 

->_j_J_J-J_l <<<<< 0.0.0.0.0- 

<«« >S->>> ***** 

ooaoo .aoooo <o<o<ococo 

<<<« HHHHf- 00000 

OOOOO ***@* >>>->-> 

MMMMM — 

>>>>> CO CO CO CO CO OOgOO 

U.U.U.U.U. MM*** 

<<<<< OOOOO U.U.U.U.U. 

<<<<< _|_J_I_1_J hhhhh 

U.U.U-U.U. >>>>> a o o a a 

x x x x x >>>>> 

U.U.U.U.U. yM^yy <<<<< 

OOOOO MM * * M hHhhf- 

52222 UJUJUJUJUJ 

jjjjj yyyyy ujujuiujuj 

JJJJJ Q.O.O.O.O. 0.0-0.0-0. 

***** QQQQQ h-HHHH 

22555 QQQQQ «<<< 



CM Cp ■<*• CO V> 

cm cj co co 



csi c5i CO « 



jf 

I 

$ 
s 

I 



1* 



3 5 5 S S S 

3 5 5 5 5 5 

3 2 S § 3 S 

2 g 3 5 3 3 

<S (N (N ^ H 



o 3 3 3 S 2 

o 3 3 3 2 2 

a S a a a a 

2 5 5 5 5 5 



ll 

S 

I 3 

3 3 

£ 5 

I 3 

I S 

* a 
1 I 1 

3 



S I I R I I I 

3 3 5 5 3 3 3 



S 2 2 5 s 2 

2 3 3 S 3 3 

s 3 § 3 3 5 

5 5 4 3 3 5 

<^ ^ m ^ 

a a S S a a 

a a a 5 a a 

2 £ 2 2 2 2 

S £ 3 3 § 3 
I 

^ 3 3 3 a 5 



o 

.Jit 

a 
\! 
m 
si 

AO 

51 

Q 
PJ 
u 
nJ 
O 

nj 



s ^ ^ E ^ s § 




llliiflfc 

o o Q o tu ui ui 'a 



ESS 



mac ---: 



I 



B 



I 5 z 



5 2 



NUCLEOTIDE SEQUENCES 



PERCENT SIMILARITY 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 










99.1 


97.3 


98.4 


98.2 


5Q.4 


42 


42,4 


2Q.7 


£0.9 


1 


2.-2 


2 


0.9 




99 


99 


99 


51.5 


43.2 


42.7 


20.2 


21.3 


2 


2-3 


3 


1.6 


6Q 




98 


98 


5Q.3 


4Z 


42.1 


20 


20 


3 


3-4 


4 


1.6 


ZO 


80 




98.4 


51.3 


43.7 


43.6 


20.4 


22 


4 


-3-9 


5 


1.8 


25 


75 


45 




50.4 


42.'5 


42.9 


22.2 


23.1 


5 


7-5 


6 


36.9 


36.4 


37.2 


36.4 


37.1 




57.2 


37.4 


23.3 


21 


6 


Tm 13.17 


7 


39.9 


41.2 


41.7 


40.7 


41.2 


30.2 




39.3 


21.9 


22.1 


7 


Bl 


8 


41.8 


42.4 


43.1 


42.3 


42.3 


45.3 


495" 




21.8 


23.1 


8 


AFP-3 


9 


48.7 


58.6 


52 


51.2 


51.9 


61.7 


62.2 


58^ 




45.6 


9 


YL-1 


10 


51.4 


62.7 


46.4 


46.5 


46.5 


60 


58.1 


61.8 


35^5" 




10 


DAFP-1A 




1 


2 


3 


4 


5 


6 


7 


8 


9 


10 







AMINO ACID SEQUENCES 



PERCENT SIMILARITY 



* 3 
2 



3 4 



6 7 



9 10 



1 




100 


99.1 


99.1 


100 


51.3 


37.4 


35.2 


11.6 


12 


1 


2-2 


2 


0 




99.1 


99.1 


100 


51.3 


37.4 


35.2 


11.6 


12 


2 


2-3 


3 


0.9 


0.9 




98.3 


99.1 


50.4 


36.5 


34.3 


11.6 


12 


3 


3-4 


4 


0.9 


0.9 


1,7 




99.1 


51.3 


37.4 


36.1 


10.7 


12 


4 


3-9 


5 


0 


0 


0.9 


0.09 




51.3 


37.4 


35.2 


11.6 


12 


5 


7-5 


6 


46.1 


46.1 


47 


46.1 


46.1 




47.4 


39.8 


13.4 


13.9 


6 


Tm 13.17 


7 


59.1 


59.1 


60 


59.1 


59.1 


51.7 




37 


11.6 


11.1 


7 


B1 


8 


61.7 


61.7 


62.6 


60.7 


61.7 


60.2 


63 




10.2 


8.3 


8 


AFP-3 


9 


86.7 


86.7 


85.7 


85.7 


85.7 


86.8 


84.2 


87.5 s 




55.6 


9 


YL-1 


10 


88.3 


88.3 


86.4 


84.5 


85.4 


89.1 


82.7 


90.5 


40^ 




10 


DAFP-1A 




1 


2 


3 


4 


5 


6 


7 


8 


9 


10 







His-tagged Clone 2 2 with signal sequence 



TTGTTAGCGG ATGGMTTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCG TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala He Val He Gly Ala 



N-terminal of mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 276 
Gin Ala Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser 

15 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 366 
Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAG GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 
He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGAC AAT AAA GGTAATA TCGTTATGTA AAAAAAAAAA 645 

AAAAAACTCG AGCACCACCA CCACCACCAC TGAGAT 681 



d^. 5.7 



His-tagged clone 2.2 without signal sequence 

TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 366 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAG GTG GAC AAG ATC 411 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 



d^. 5.8 



His-tagged clone 2 3 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 

His -tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 

Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 



AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 



AAA CTC CTC TTG TGC TTT GCT TTC GCC GCC ATC GTC ATC GGA GCT 
Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala He Val He Gly Ala 



N -terminal of Mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 276 
Gin Ala Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser 

1 5 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 366 
Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAA GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 
He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp * 
105 110 115 

Polyadenylation signal Poly-Atail 
TTTGACTGAA TTTTGAC AAT AAA GGTACTA TCGTTATGAA AAAAAAAAAA 645 

AAAAAAACTC GAGGACCACC ACCACCACCA CTGAGAT 682 



d^?. 5.3 



His-tagged Clone 2.3 without signal sequence 

TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-temninal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 366 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAA GTG GAC AAG ATC 411 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 



d^. 5 JO 



His-tagged Tm 13 17 with signal sequence 

TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 

His-tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 

Met Gly Ser Ser His His His His His His Ser 

-65 -60 -55 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-50 -45 -40 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC TGG ATC CAA AGA ATT 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Trp He Gin Arg He 



AFP Start Codon 
CGG CAC GAG ACT ACT AAG ATG AAG TTG CTC TGT TGT CTA ATC TCC 
Arg His Glu Thr Thr Lys Met Lvs Leu Leu Cvs Cy s Leu He Ser 



N-termmal of mature AFP 
CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG ACC GAG GCA CAA ATT 
Leu He Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin He 



GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA 
Glu Lys Leu Asn Lys He Ser Lys Lys Cys Gin Asn Glu Ser Gly 



GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG 
Val Ser Gin Glu He He Thr Lys Ala Arg Asn Gly Asp Trp Glu 



GAC GAT CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC 
Asp Asp Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn 



GCC GGT CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG 
Ala Gly Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu 



AGG GAG AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG 
Arg Glu Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu 



AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG 
Lys He He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu 



ACG GTG TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG 
Thr Val Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys 



Stop Codon 

TTC TCA CCA GTT GAT TGA ACCACCACGA CTAGTAGATG GTTCAAATGG 
Phe Ser Pro Val Asp * 



Polyadenylation signal Poly-A tail 
TGTGCTTTAC ATATAA AAAT AM GTGTTTC TGATGTAAAA AAAAAAAAAA 

AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG CCCATCGTTT 

TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 



His -tagged Tm 13.17 without signal sequence 

TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N- terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGC CTG ACC GAG GCA CAA ATT GAG AAA 186 
Gly Gin Gin Met Gly Arg Gly Leu Thr Glu Ala Gin He Glu Lys 
-5 1 5 

CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA GTG TCG 231 
Leu Asn Lys He Ser Lys Lys Cys Gin Asn Glu Ser Gly Val Ser 
10 15 20 

CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG GAC GAT 276 
Gin Glu He He Thr Lys Ala Arg Asn Gly Asp Trp Glu Asp Asp 
25 30 35 

CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC GCC GGT 321 
Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn Ala Gly 
40 45 50 

CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG AGG GAG 366 
Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu Arg Glu 
55 60 65 

AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG AAA ATC 411 
Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu Lys He 
70 75 80 

ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG ACG GTG 456 
He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu Thr Val 
85 90 95 

TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG TTC TCA 501 
Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys Phe Ser 
100 105 110 

Stop Codon 

CCA GTT GAT TGA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro Val Asp * 
115 
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Ala 
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Asx 




Cysteine 
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Aspartic Acid 


^ p 
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Glutamic Acid 
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Phenylalanine 


Ph 
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Glycine 
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Histidine 
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Isoleucine 


He 
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. 

ysme 


Lys 




Leucine 


ruft 


M 


Methionine 


Met 




Asparagine 


sn 


O 






P 


Proline 


Pro 


Q 


Glutamine 


Gin 
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Arginine 


Arg 
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Serine 


Ser 
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Threonine 


Thr 


U 
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Valine 
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Tryptophan 
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X 
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Tyrosine 
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highly hydrophobic 
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basic imidazole 


highly hydrophilic 
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aliphatic 
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highly hydophilic 
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aliphatic 


hydrophobic 
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amide, acidic derived 


hydrophilic 
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aliphatic, cyclic, imino 


mod. hydrophilic 
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amide, acidic derived 


hydrophilic 
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basic 


highly hydrophilic 
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aliphatic hydroxyl 


hydrophilic 
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aliphatic hydroxyl 


hydrophilic 
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aliphatic 


hydrophobic 
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highly hydrophobic 
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mod. hydrophilic 
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0.9% NaCl 1.8% NaCl 

NaCl concentration 
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□ -6o C 
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0.9% NaCl baseline 



O | 



-3 -2 
log(Tm 12.86 concentration) 



0.9% NaCl baseline 
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WD 



0.005 - 



log(dilution) 
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log(dilution) 




log(dilution) 
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dFlp. 8.43 
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log(dilution) 
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log dilution 
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log dilution 
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log(dilution) 
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log(dilution) 



8.23 
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log(dilution) 
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thermal hysteresis (<>C) 
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"Sandwich" method of R.I. 
assessment 

1. 




3. FREEZE ON —80 C 
ALUMINUM PLATE (-10 MIN.) 

\ 

4. PLACE ON COLD STAGE, 
ANNEAL AT -6 C UP TO 
12+ HOURS 
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DNA sequence of Tm 13.17 cDNA clone 



B 



I 



c 



m 

S 
I 



o 



R 
I 



1 3U5tK3X3RTCC^ 



LILLVTVOAAL TBAQIEKLNK 



121 j^ftT^ryn^aa&RTV^AAAiiTGiy^^ 

ISKKCQNES0VSQEXI1?KAR 

181 GCAACGGTGACTGGXmGGACGATCCTAAACTGAAAC^ 

NGDWBDDPKLKRQVFCVARH 

241 A€Xj€^GK3TC¥GG^ 

AGLATE SGEVVVDVLREKVR 

F 

301 nraj&ftfxwc»flTC^CjACG&CGAAG^ 

KVTDNDEETKKI INKCAVKR 

p y 

361 <gj)GATACTG!PT^ 

DTVEETVFNTFKCVMKNKPK 

& 

421 AGTD^^^^^l^^MACCACCACGACTAGTAGATGGTTCAAATGGTGTGCTTTAC 
F S P V D * 



a 
I 

481 ATATAAAAATAAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAACTCG 

polyadenylation signal poly (A) tail (26) 
53? AGAGTAT'fCTAGAGCGGCCGCGGGCCCATCGT'KPTCCACCC 



MKLLCCLIS 



€1 COCTCA1?TCTCTTGG1?CACAGTTCAGGC< 




X 

b 



d?fy. 8.43 



p 

GGCACGAGCAAAA jAT Gj A AACTCCTCTTGTGCTTT G lC GTTCGCCGCC 



ATCGTC ATCj GGAGCT C A G G CT pT C A C C G A C G A A C A ~G)AT AC AG A A A 
I V 1 G A Q A AL T D E Q I Q K 

AGGAACAAGATCAGCAAAGAAT GCCAGCAGGTGTC C IGG A GT GT C C 
RNKI SKECQQVSGVS 

V 

C A A G A| G ACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGATGAT 
Q E Tl DKVRTGVL VDD 

CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

GTGGCAACCGA A G C C G G A G A C A C C A A T GT G G A G GT ACTC AAAGCC 



AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

P 

CAGAAGTGCGTGGTCAAGAA GGC C |A C A C CAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

P & 
GACACCl TTCAAGTGTATTTACGACAGCAAACCTGATTT lCTCTCCT 
DTFKCI YDSKPDF ^5 P " 



ATTGATTA Ai T TGTTTTGTATTTGAOTGAATTTTGA CAATAAA GGT 
I D * 

polyadenylation signal 

ACTATCGTTATGTAAAAAAAAAAAAAAAAA 
poly (A) tail 



